Philippine Language Resources: Applications, Issues, and Directions

نویسندگان

  • Nathaniel Oco
  • Leif Romeritch Syliongka
  • Tod Allman
  • Rachel E. O. Roxas
چکیده

In this paper, we present our collective effort to gather, annotate, and model various language resources for use in different research projects. This includes those that are available online such as tweets, Wikipedia articles, game chat, online radio, and religious text. The different applications, issues and directions are also discussed in the paper. Future works include developing a language web service. A subset of the resources will be made temporarily available online at: http://bit.ly/1MpcFoT.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Philippine Language Resources: Trends and Directions

We present the diverse research activities on Philippine languages from all over the country, with focus on the Center for Language Technologies of the College of Computer Studies, De La Salle University, Manila, where majority of the work are conducted. These projects include the formal representation of Philippine languages and the processes involving these languages. Language representation ...

متن کامل

e-Wika: Digitalization of Philippine Language

In this paper, we present what we have attempted towards the digitalization of the Philippine languages and their respective applications, and what we intend to do in the future. We present the development of a multi-engine bi-directional English-Filipino Machine Translation (MT) system, and the building of various language resources and tools for this system. We also discuss our experiments on...

متن کامل

e-Wika: Philippine Connectivity through Language

In this paper, we present what we have attempted towards connecting the Philippine islands through the digitalization of the Philippine languages and their respective applications, and what we intend to do in the future. We present the development of a multi-engine bi-directional English-Filipino Machine Translation (MT) system, and the building of various language resources and tools for this ...

متن کامل

Philippine Languages Online Corpora: Status, issues, and prospects

This paper presents the work being done so far on the building of online corpus for Philippine languages. As for the status, the Philippine Languages Online Corpora (PLOC) now boasts a 250,000-word written corpus of the eight major languages in the archipelago. Some of the issues confronting the corpus building and future directions for this project are likewise discussed in this paper.

متن کامل

Constituent Structure for Filipino: Induction through Probabilistic Approaches

The current state of Philippine linguistic resources, which includes formal grammars, electronic dictionaries and corpora are not yet significant to address industrialstrength language technologies. This paper discusses a computational approach in automatically estimating constituent structures from a corpus using unsupervised probabilistic approaches. Two models are presented and results show ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016